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ABSTRACT 

Riboswitch RNAs fold into complex tertiary struc- 
tures upon binding to their cognate ligand. Ligand 
recognition is accomplished by key residues in the 
binding pocket. In addition, it often crucially depends 
on the stability of peripheral structural elements. 
The ligand-bound complex of the guanine-sensing 
riboswitch from Bacillus subtilis, for example, is 
stabilized by extensive interactions between apical 
loop regions of the aptamer domain. Previously, we 
have shown that destabilization of this tertiary loop- 
loop interaction abrogates ligand binding of the 
G37A/C61U-mutant aptamer domain (Gsw loop ) in 
the absence of Mg 2+ . However, if Mg 2 is available, 
ligand-binding capability is restored by a population 
shift of the ground-state RNA ensemble toward RNA 
conformations with pre-formed loop-loop inter- 
actions. Here, we characterize the striking influence 
of long-range tertiary structure on RNA folding kin- 
etics and on ligand-bound complex structure, both 
by X-ray crystallography and time-resolved NMR. 
The X-ray structure of the ligand-bound complex 
reveals that the global architecture is almost identi- 
cal to the wild-type aptamer domain. The population 
of ligand-binding competent conformations in the 
ground-state ensemble of Gsw loop is tunable through 
variation of the Mg 2 concentration. We quantita- 
tively describe the influence of distinct Mg 2+ con- 
centrations on ligand-induced folding trajectories 
both by equilibrium and time-resolved NMR spec- 
troscopy at single-residue resolution. 



INTRODUCTION 

RNA function critically depends on RNA structure and 
folding; and both, RNA structure and folding can be sig- 
nificantly modulated by mono- and divalent cations (1-3). 
RNA folding kinetics are markedly influenced by the 
ground-state conformational RNA ensemble, and 
folding traps sequestering the folding-competent ensemble 
of functional RNAs are frequently encountered. The de- 
pendence of the rate of folding on the ground-state con- 
formational ensemble is significantly more pronounced for 
RNA compared to proteins, since substantial barriers for 
inter-conversion exist between substates of RNAs with 
alternative base pairing schemes even if these substates 
can have almost identical stability (4). 

Riboswitches, a class of m-acting RNA regulatory 
elements, bind specific metabolites with high affinity and 
selectivity. Metabolite binding induces a substantial allo- 
steric conformational rearrangement. Riboswitches, there- 
fore, represent a particularly striking example for the 
coupling of ground-state conformational characteristics 
and the rate of productive ligand-induced RNA folding. 
The conformational switch is key to the RNA-based gene 
regulation mechanism and involves folding of the RNA 
into one of possible alternative structures (5,6). Ligand- 
binding changes long-range interactions in the complex 
riboswitch fold in response to local conformational 
changes. These conformational changes induced by 
ligand binding trigger signal transduction between differ- 
ent RNA domains. 

For transcription-regulating riboswitches, several 
folding events and their rates are coupled to RNA-based 
regulation. Depending on ligand concentration, the rate of 
folding of the metabolite-sensing RNA aptamer domain, 
the ligand-binding event itself and the subsequently 
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Figure 1. Crystal structure of the G37A/C61U-mutant (Gsw oop ) from the B. subtilis guanine-sensing riboswitch aptamer domain in complex with the 
ligand thioguanine (thioG); (a) ribbon representation of the 3D structure of the Gsw loop -thioG complex (chain a). The ligand thioG is presented in 
yellow dots; helices PI, P2 and P3 are color coded in blue, ligand-binding core region and loop regions are color coded in red; cobalt hexamine ions 
are shown in yellow stick representation; (b) Close-up view (rotated by 90° along vertical axis) of tertiary loop-loop interactions (RNA backbone of 
residues 32-38 in L2 and 60-66 in L3 are given; gray: chain a, blue: chain b). Residues of inter-helical base quadruples are shown in stick 
representation (arrows indicate 5'- to 3'-helix direction); (c) stick representation of the inter-helical base quadruple, including the mutations; 
black dashed lines represent hydrogen bonds, mutated inter-base pair angle is annotated in gray [U34(02)-A65(N6)-U61(02) ~117°]; 
(d) close-up view of ligand-binding region from crystal structure of Gsw locp -thioG. Local heterogeneity can be observed in molecules of one 
asymmetric unit in agreement with NMR spectra (Supplementary Figure S2) (gray: chain a, blue: chain b, with respective atom distances annotated). 



induced allosteric conformational rearrangement of the 
RNA regulatory structural element need to proceed in a 
time window suitable for interference with mRNA synthe- 
sis. Intrinsically coupled to the regulatory structural 
rearrangement is the ability of the aptamer domain to 
adopt a ligand-binding competent conformation. This 
ability constitutes a coupled pre-equilibrium to the regu- 
latory conformational event. 

Various X-ray structures of ligand-bound riboswitch 
aptamer domains illustrate a variety of different RNA 
molecular recognition motifs that achieve specific metab- 
olite binding (7-9). Bivalent cations, in particular Mg 2+ , 
often stabilize the 3D architecture of riboswitches and 
ligand binding can in fact even critically depend on 
them. In addition, X-ray and biochemical studies illustrate 
that the global architectures of a number of different 
riboswitch elements is stabilized by long-range tertiary 
RNA-RNA interactions. In numerous cases, these 
long-range tertiary interactions are relevant for riboswitch 
function and/or ligand binding (9). Thus, in addition to 
ligand concentration, structural organization of long- 
range tertiary elements in the aptamer domains and the 
Mg 2+ dependence of their formation can affect the kinetics 
of ligand binding and further riboswitch regulation. 

In order to correlate ground-state ensemble character- 
istics defined by highly conserved residues forming 
long-range interactions with RNA folding kinetics, we in- 
vestigate here the structure of the G37A/C61U-mutant of 
the aptamer domain of the guanine-sensing riboswitch 



RNA from Bacillus subtilis by X-ray crystallography and 
static NMR spectroscopy. Additionally, we use time- 
resolved NMR spectroscopy to observe the ligand- 
induced folding kinetics and decipher how pre-formation 
and stabilization of structural elements affect the course of 
subsequent tertiary folding for guanine-sensing riboswitch 
aptamer domains. 

In the ligand-bound form, the overall fold of the 
aptamer domain consists of three helical segments 
(P1-P3) (Figure 1 and Supplementary Figure SI) whose 
relative orientations are determined by a large number of 
tertiary interactions in the ligand-binding pocket as well as 
tertiary long-range loop-loop interactions (10,11). 
Stabilization of the loop-loop interaction leads to an 
almost parallel compact orientation of the two helices 
P2 and P3 that persist in the ligand-free state of the 
wild-type aptamer domain. As a result of the helical com- 
pactness, electrostatic repulsive interactions accumulate 
that need to be compensated by cations. The formation 
of the long-range loop-loop interactions sensitively 
depends on the sequence of interacting nucleotides 
and can therefore be disrupted even by conservative 
mutations. Previously, we could show by site-directed mu- 
tagenesis comparing the binding characteristics for the 
wild-type aptamer domain and a G37A/C61U-mutant 
that the stable formation of the loop-loop interaction is 
not a pre-requisite for specific ligand binding (12). The 
mutant Gsw loop is substantially more dynamic; however, 
both long-range structure formation and ligand binding in 
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the core region can be restored and even induced inde- 
pendently as a function of the [RNA]:[Mg 2+ ] concentra- 
tion ratio. Due to the possibility to resolve distinct 
conformational transitions by a combination of static 
and time-resolved NMR experiments, we are able to 
dissect ground-state conformational ensemble characteris- 
tics and relative contributions of different structural 
elements with the ligand-induced RNA folding rates. 
These variations of ground-state structural RNA 
ensemble characteristics yield insight in RNA structures 
and in their populations. 

Thus, the mutant Gsw loop represents an optimal system 
to determine how the 3D architecture of the guanine- 
sensing riboswitch is linked to its function and which 
processes determine the time scale of productive RNA 
folding and thus modulate transcriptional regulation. 
Next to structural modifications of ligand functional 
groups that were shown to be correlated with binding 
affinities (13), characterizing and potentially modulating 
the ligand-induced RNA folding processes is vital to 
understand this dynamic regulation processes and import- 
ant for potential applications of these RNA modules. 

MATERIALS AND METHODS 

RNA synthesis 

G37A/C61U-mutant aptamer domains of the guanine- 
sensing riboswitch of the B. subtilis xpt-pbuX operon 
were prepared by in vitro transcription as described 
before (12) (see Supplementary Data and Supplementary 
Figure SI for RNA construct details). 15 N-labeled rNTPs 
were purchased from Silantes (Munich). NMR samples 
were prepared in H 2 0/D 2 0 (9:1) using the following 
buffer conditions: 25 mM potassium phosphate, pH~6.2, 
50 mM potassium chloride. 

RNA crystallization and X-ray data collection 

G37A/C61U-mutant RNA was concentrated to 450 uM in 
a buffer solution composed of lOmM K + -HEPES buffer 
(pH~7.5), 750 uM ligand thioguanine (thioG) and 5mM 
[Co(NH 3 ) 6 ] 3+ . Crystallization trials were performed using 
the hanging-drop vapor-diffusion method where 1 ul of 
the RNA solution was mixed with 1 ul of reservoir 
solution [5mM K + -HEPES (pH~7.5), 12mM 
[Co(NH 3 ) 6 ] 3+ , 25% polyethylene glycol 4000 and 
450 mM ammonium acetate) and incubated at room tem- 
perature. RNA-thioG crystals grew within 1 day and were 
flash-frozen in liquid nitrogen after adding 30% 
2-methyl-2,4-pentanediol for cryoprotection. Diffraction 
data at 2.5 A resolution were collected at beamline PXII 
of the Swiss Light Source (Villigen, Switzerland). Data 
were processed using the XDS package (14). Unit-cell par- 
ameters are a = b= 52.3 A, c = 263.4 A and a = P = 90°, 
y = 120°. 

X-ray structure determination and refinement 

The X-ray structure was determined by molecular replace- 
ment with the program PHASER (15) using the structure 
of the B. subtilis xpt-pbuX guanine-sensing riboswitch 



RNA in complex with hypoxanthine [pdb code: 1U8D 
(10)] as the search model and assuming space group P3 2 
(No. 145). Initial attempts using, for example, the 
apparent space group P3 2 21 had failed. Indeed tests for 
twinning [PHENIX (16) and the twinning server (17)] 
revealed nearly perfect merohedral twinning of the 
crystal (twinning fraction 0.466, twin-law -h,-k,l). The 
MR model was refined with PHENIX (16) and CNS 

(18) using non-cry stallographic symmetry restraints and 
the twinning parameters. The i? tree -factor was calculated 
using 5% of the data not used in the refinement. Manual 
model adjustment, mutation of residues and placement of 
thioG and of [Co(NH 3 ) 6 ] 3+ were carried out using COOT 

(19) . In total 26 [Co(NH 3 ) 6 ] 3+ and 12 water molecules 
were identified on the basis of the electron density maps. 
The R- and i? tree -factors for the final structure were 21% 
and 23%, respectively. X-ray data collection and refine- 
ment statistics are summarized in Supplementary Data 
and Supplementary Table SI. The RNA structures 
related by non-crystallographic symmetry are very 
similar. Therefore, the structure analysis is based on 
chain a. PyMol was used for calculating r.m.s.d. and for 
preparing figures. Structural analysis was performed using 
3DNA (20). 

NMR spectroscopy 

NMR experiments were recorded on a Bruker NMR spec- 
trometer AV700MHz with a 5-mm z-axis gradient 
TXI-HCN cryogenic probe using a standard 
'H, 15 N-HSQC pulse sequence at different temperatures 
(273-293 K) and 'H^H-NOESY and 'H^H^N-filter, 
co 2 }-NOESY pulse sequences (sample conditions: uniform- 
ly 15 N-labeled RNA and unlabeled ligand). 

Time-resolved NMR spectroscopy 

Kinetic NMR experiments were recorded on a 
AV700 MHz Bruker NMR spectrometer equipped with a 
5-mm z-axis gradient TXI-HCN cryogenic probe at 283 K. 
The in situ initiation of the hypoxanthine- and/or Mg 2+ - 
induced RNA folding processes was realized by using a 
rapid mixing device (21). Initially, a solution with a 
volume of 300 ul was placed in a NMR shigemi 
tube. The injection solution (cofactor) had a volume of 
40 ul. The [RNA]: [ligand] ratios were ~1:1 and Mg 2+ con- 
centrations were specified in equivalents (eq) with refer- 
ence to the respective final RNA concentration 
([RNA] abs ~300-700 uM). The kinetic experiments were 
recorded with selectively 15 A^-uridine-labeled RNA in con- 
junction with NMR filter experiments and analyzed as 
described, using the software TOPSPIN 1.3, felix2000 
(MSI) and SigmaPlot 9.0 (22). Supplementary Figure S3 
shows the appropriate nucleotide-specific NMR spectra of 
Gsw loop prior to and after injection of the ligand hypo- 
xanthine. Individual time constants of ligand-induced 
RNA folding were analyzed in time-resolved NMR ex- 
periments at an RNA concentration of 
[Gsw loop ]~670 uM and an [RNA]: [ligand] :[Mg 2+ ] ratio of 
~1:1:8 with a time resolution of ~3.4s/data point. The 
Mg 2+ dependence of ligand-induced RNA folding of 
Gsw loop was analyzed by determination of overall time 
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constants (/r 0 veraii [s -1 ]) for the individual kinetics at each 
[RNA]:[Mg 2+ ] ratio. The overall time constant A: oveia n was 
calculated from the sum of normalized integrals over time 
of individual signals, respectively. The kinetic data were 
fitted with a monoexponential function, half-life values 
were obtained using the formula for a first-order process 
{t\j2 = \n2jk). The error, stated for k, is the fitting error. 
The error value for t xj2 results from error propagation 
(Supplementary Data and Supplementary Table S2). 
Specifically, the result with the best accuracy could be 
obtained by integrating imino proton signals (40%) 
using all data points of the kinetics [as also performed 
for the wild-type RNA (22)]. For an analysis of the 
effects of different processing methods, we refer to 
Supplementary Table S3. 

RESULTS 

In order to dissect how pre-formation and stabilization of 
structural elements and conformational dynamics in the 
ligand-free state of the RNA ensemble affect the course 
of ligand-induced RNA folding and how compactness of 
RNA-ligand structure is linked to the energetic barriers of 
RNA refolding, we investigated the G37A/C61U-mutant 
(Gsw loop ) of the guanine-sensing riboswitch aptamer 
domain of the B. subtilis xpt-pbuX operon. RNA folding 
involves formation of hydrogen-bonding interactions in 
two structural regions of the aptamer domain: the 
long-range inter-helical structure and the ligand-binding 
core region. The mutations were designed in such a way 
that long-range interactions are weakened but can still 
form. However, in contrast to the wild-type guanine- 
sensing riboswitch aptamer domain, multivalent 
(bivalent) cations are crucial for ligand binding and dif- 
ferent [RNA]:[Mg 2+ ] concentration regimes could be 
delineated which differ in the stabilization of long-range 
tertiary structure and in the ligand-binding characteristics 
for Gsw loop (12). At intermediate [RNA]:[Mg 2+ ] concen- 
trations, Gsw loop binds hypoxanthine even without 
pre-formed loop-loop interaction, while at high 
[RNA]:[Mg 2+ ] concentrations, the loop-loop interaction 
is restored in the ligand-free RNA conformation (12). 

Crystal structure of the G37A/C61U-mutant reveals a 
compact fold in the ligand-bound conformation 

Crystals of the Gsw loop -thioG complex belong to space 
group P32 and diffract up to 2.5 A. The asymmetric unit 
contains four non-identical RNA chains (a-d) with each 
RNA chain bound to one thioG molecule. The four 
complexes per unit cell o are pair-wise o virtually identical, 
with r.m.s.d. of ~0.21 A and ~0.22A between chain a 
and d and between chain b and c, respectively, and an 
r.m.s.d. of ~0.53 A between chain a and chain b. A total 
of 26 [Co(NH 3 ) 6 ] 3+ ions were identified per asymmetric 
unit, with seven ions bound to chains a and d, respectively, 
five ions bound to chains b and c and two [Co(NH 3 ) 6 ] + 
ions which cannot be specifically assigned. 

Next to the smaller number of [Co(NH 3 ) 6 ] 3+ -binding 
sites found for chains b and c, these two complexes 
display a slightly altered position of the ligand thioG in 



the binding pocket (Figure 1). The distance of the ligand 
Watson-Crick (WC)-site to the C74 WC site is reduced by 
~0.2A (averaged over all three hydrogen bonds) 
compared to the complexes of chains a and d. In 
contrast, in chains b and c, the ligand o sugar-edge 
distance to the U51 WC site is larger by ~0.9 A, rendering 
hydrogen-bonding interactions between these nucleobases 
unlikely. Interestingly, this flexibility of the ligand position 
within the binding pocket is also observed by NMR spec- 
troscopy (Supplementary Data and Supplementary 
Figure S2) and was previously proposed based on obser- 
vations of ultrafast fluorescence spectroscopy. In this 
fluorescence study, alternative ligand positions were 
assigned to different possibilities of favorable stacking 
interactions in the sandwich-like binding pocket (23). 
Additionally, previous X-ray studies revealed a certain 
degree of flexibility by C74 upon binding to ligand 
analogs with modifications at their six-position (24). Due 
to the shift of thioG in the binding pocket of chain b 
versus chain a, the thioG-C74 interface is also affected 
and adopts a sheared conformation in the RNA chain 
b-thioG complex (Figure 1; Supplementary Data and 
Supplementary Figure S2). As the binding pocket struc- 
ture of the complexes formed by RNA chains a and d 
represent the consensus ligand-bound form of the 
wild-type aptamer complex most closely (10,11), we will 
focus in the following on the structure of the RNA chain 
a-thioG complex. 

The overall fold of the Gsw loop -thioG complex is very 
similar to the wild-type aptamer domain bound to the 
ligand hypoxanthine (10) (r.m.s.d. ~0.85A) and 
purine-sensing riboswitches in general (11,25-27). All 
key secondary and tertiary structural features are main- 
tained by the mutant aptamer domain, including the 
parallel arrangement of helices P2 and P3 that is stabilized 
by tertiary loop-loop interactions between L2 and L3 
(Figure 1). Specifically, the long-range interactions are 
formed by residues G38 and A37 of loop L2 involved in 
WC base pairs with C60 and U61 of loop L3 and A33 and 
U34 of loop L2 forming non-canonical Hoogsteen base 
pairs to A66 and A65 of loop L3. Thus, introducing 
the conservative double mutation G37A/C61U results in 
the same loop-loop interaction architecture as in the 
wild-type, although local differences are observed in the 
crystal structure of the Gsw loop -thioG complex. First of 
all, the backbone morphology of the loop-loop inter- 
action as represented by the positions of the phosphorous 
atoms of residues 32-38, in L2, and residues 60-66, in L3, 
displays an r.m.s.d. of only ~0.6 A to the corresponding 
wild-type sequence. The altered nucleotide sequence in the 
mutant results in formation of less inter- and intra-helical 
loop-loop hydrogen bonds. The WC base pair A37-U61 
naturally has only two hydrogen bonds compared to three 
in case of the wild-type G37-C61 base pair. Furthermore, 
since the exocyclic amino group at position 2 of A37 is 
missing, a hydrogen bond to U34 cannot form. This 
lacking hydrogen bond leads to a different orientation of 
the Hoogsteen base pair U34-A65 with respect to base 
pair A37-U61 {inter-base pair angle [U34(02)- 
A65(N6)-U61(02)] -117°} when compared to the wild- 
type, for which these base pairs are co-planar. 
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Interestingly, the double mutation induces mostly 
indirect effects in the context of the overall tertiary struc- 
ture of the complex. The base quadruple containing the 
mutated base pair is mainly distorted at residues U34 and 
A65 versus the wild-type structure. The U34-A65 base 
pair is staggered by ~0.97 A in the mutant, thereby weak- 
ening inter-base quadruple hydrogen bonding. This weak- 
ening is also observed by NMR spectroscopy in solution. 
The signal intensity of the U34 imino proton resonance is 
substantially decreased in the 'H, 15 N-HSQC (12). The 
second base quadruple stabilizing the loop-loop inter- 
action consists of the WC base pair G38-C60 and the 
Hoogsteen base pair A33-A66. Here, the WC base pair 
stacking below the mutated A37-U61 base pair is 
characterized by a significant propeller twist of ~— 24° 
compared to ~— 11° in the wild-type structure, while the 
Hoogsteen base pair is buckled by ~— 18° versus ~— 9° in 
the wild-type. Additionally, the double mutation leads to a 
reduced stability of the closing base pair A59-U67 of P3. 
The imino proton-mediated hydrogen bond distance is 
stretched by ~0.4A compared to the wild-type base 
pair, which is likewise observable by a weak signal inten- 
sity of imino proton signal U67 in the 'H, 15 N-HSQC (12). 
In summary, the stability of the loop-loop interaction in 
the Gsw loop -thioG complex is reduced by the loss of 
several hydrogen bonds and favorable stacking inter- 
actions relative to the wild-type, which is also reflected 
in thermal melting curves (12). 

As a side remark, the crystal structure of the Gsw loop - 
thioG complex also illustrates that the P2 stabilizing mu- 
tations introduced by us in NMR studies concerning the 
guanine-sensing riboswitch aptamer (12,22,28,29) has no 
influence on the tertiary fold of the final RNA-ligand 
complex as judged by a backbone r.m.s.d. of only 
~0.61A compared to the wild-type sequence 
(Supplementary Figure SI). 

Formation of ligand-binding functional states of Gsw loop 

The influence of Mg 2+ on overall compaction of RNA and 
electrostatic compensation of negative charges has been 
shown for numerous RNAs. In agreement with previously 
observed Mg 2+ -induced chemical shift perturbations 
(CSP) for several imino proton signals of (i) Gsw loop and 
(ii) Gsw apt by solution NMR (12,28), cobalt hexamine 
ions could also be identified in the crystal structure of 
the Gsw loop -thioG complex as illustrated in Figure 2a. 
We compared the [Co(NH 3 ) 6 ] 3+ -binding sites identified 
in the crystal structure to the RNA residues displaying 
significant NMR CSPs in Mg 2+ titration experiments, (i) 
In comparison to NMR studies of Gsw loop , all residues 
except for U26 and U34 that experience CSPs of > 15 Hz 
in the Mg 2+ titration experiments are also in close prox- 
imity to a bound [Co(NH 3 ) 6 ] 3+ ion in the crystal structure, 
supporting the role of [Co(NH 3 ) 6 ] 3+ as a Mg 2+ outer-shell 
hydration mimic and likely excluding any essential role of 
inner-shell coordinated Mg 2+ -ions for folding of this 
riboswitch RNA (Figure 2a). (ii) Comparison of the 
Gsw loop -thioG crystal structure with both the wild-type 
crystal structure bound to hypoxanthine [pdb: 1U8D 
(10)] and Mg 2+ -dependent NMR titration studies of 



Gsw apt (28) revealed that exclusively G72 seems to be a 
unique Mg 2+ -binding site for the wild-type RNA-ligand 
complex (Figure 2b). This imino proton shows no signifi- 
cant Mg 2+ -induced CSPs by NMR in the mutant aptamer 
domain, while for all other residues, CSPs can be mapped 
to the same structural regions both in Gsw loop (Figure 2a) 
and in Gsw apt (Figure 2b). 

As one example, we further characterized the effect of 
Mg 2+ by inspection of NMR spectral characteristics of the 
imino proton signal of nucleotide U81 localized in helix 
PI, for which a cobalt hexamine ion could be observed 
in close spatial proximity within the crystal structure 
(Figure 2). At a [Gsw loop ]:[Mg 2+ ] ratio of -1:7, hypoxan- 
thine binds to Gsw loop ; as a consequence of this binding 
process, the chemical shift of U81 changes by only 
A8~8.2Hz (Figure 2d). In contrast, Mg 2+ binding to the 
RNA induces strong chemical shift changes of this 
signal. While a [Gsw Kop ]:[Mg 2+ ] ratio of -1:7 induces a 
chemical shift change of A5( 0 _7 e q)~24.6Hz, at a ratio of 
[Gsw loop ]:[Mg 2+ ]~ 1:20 a difference of A5 (0 _ 2 oeq)~49.1 Hz 
is observed (Figure 2e). These NMR data illustrate that 
Mg + has a strong effect on the conformational ensemble 
characteristics of the free RNA, even before the RNA 
shows well-established long-range tertiary interactions 
(12). Time-resolved NMR experiments following spectral 
changes of the imino proton signal of nucleotide U81 over 
time reveal the kinetics of the respective Mg 2+ -induced 
conformational changes of Gsw oop to be faster than 
~8-10s (Figure 2c), in line with a rapid structural 
collapse induced by Mg 2+ binding. 

Ground-state conformational dynamics affect the time 
course of ligand-induced RNA folding 

The mutant riboswitch Gsw oop was designed to probe the 
importance of the long-range loop-loop interaction on 
ligand-induced folding rates and involved conformational 
states of the guanine-sensing riboswitch aptamer domain. 
Since pre-organization of the tertiary loop-loop inter- 
action in the ligand-free state of Gsw oop critically 
depends on the Mg 2+ concentration, time-resolved NMR 
experiments of ligand-induced RNA folding were per- 
formed at various Mg 2+ concentrations in order to inves- 
tigate whether differences in the RNA folding rates can be 
observed (i) for different structural elements as previously 
detected for the wild-type Gsw apt (12) and for other 
riboswitch aptamer domains, as e.g. the TPP-sensing 
riboswitch (30) and (ii) to test the influence of 
ground-state conformational dynamics on ligand-induced 
riboswitch folding. 

Probing the impact of native tertiary structure pre- 
organization on ligand-induced RNA folding. We 
analyzed the ligand-induced RNA folding kinetics at a 
[RNA]:[Mg 2+ ] ratio (-1:8), for which static NMR 
studies demonstrated ligand-binding competence but no 
stable pre-formation of long-range tertiary loop-loop 
interactions (12). Hypoxanthine-induced RNA folding 
could be monitored for 12 resolved imino proton 
reporter signals (Supplementary Data, Supplementary 
Figure S3 and Supplementary Table S2). Representative 



Nucleic Acids Research, 2011, Vol. 39, No. 22 9773 




Figure 2. Hypoxanthine- and Mg 2+ -induced effects on conformation and folding of Gsw loop . (a) Mg 2+ binding by the Gsw loop -thioG complex (pdb: 
3RKF). RNA residues of Gsw loop (sequence position/structural element (77)/Pl, (26, 30)/P2, (55, 56, 67)/P3), (34, 61, 38)/loop region, (47)/binding 
pocket that show NMR imino proton chemical shift changes upon Mg 2+ titration A5 (0 -33 eq )>15Hz (12) are annotated in stick representation on the 
crystal structure of the G37A/C61U-mutant; cobalt hexamine ions are found in close proximity to the respective residues in the crystal structure 
(cobalt hexamine ions are highlighted in yellow); (b) Mg 2+ binding by the Gsw ap '-hypoxanthine complex [pdb: 1U8D (10)]. RNA residues of Gsw apl 
(sequence position/structural element (77)/Pl, (26, 30, 31)/P2, (55, 56, 67, 72)/P3), (37, 38)/loop region, (47)/binding pocket that show NMR imino 
proton chemical shift changes upon Mg 2+ titration AS (0 _ 33eq )>15Hz (28) are annotated in stick representation on the crystal structure of the 
wild-type Gsw 1<pl -hypoxanthine complex. Cobalt hexamine ions are found in close proximity to the respective residues in the crystal structure 
(cobalt hexamine ions are highlighted in orange); (c) kinetics of Mg 2+ - and hypoxanthine-induced RNA-ligand complex formation. Spectral 
changes recorded for imino proton signal U81 over time [s] are illustrated (*): time point of injection (;~0s) of Mg 2+ and hypoxanthine; time 
resolution/ID spectrum ~8.6s; black: ID-spectra before injection, red: ID-spectra following injection); (d) addition of hypoxanthine leads to small 
chemical shift changes of the imino proton signal of nucleotide U81 in helix PI (A8~8.2Hz) (overlay of 'H, 15 N-HSQC spectral region of U81/ 
Gsw loop at an [RNA]:[Mg 2+ ] ratio of ~1:7 in presence (red) and absence of hypoxanthine (black)); (e) addition of Mg 2+ leads to significant chemical 
shift changes of U81 (overlay of 'H, 15 N-HSQC spectral region at different [RNA]:[Mg 2+ ] ratios: A8 ( o_ 7eq) ~24.6 Hz and A5 (0 _ 2 oeq)~49.1 Hz). 



reporter signals could be determined for the ligand itself, 
the ligand-binding core region of the RNA, the helical 
elements as well as the tertiary loop-loop interactions. 
Kinetic analysis of the Gsw loop folding process did not 
reveal significant differences in the folding kinetics of in- 
dividual nucleotides for different structural elements 
(Figure 3 and Supplementary Table S2). The kinetic 
traces could be fitted to monoexponential functions and 
the determined half-lives (? 1/2 [s]) were found to vary 
between 20.1s and 25.6 s (Supplementary Table S2). 
Destabilization of long-range tertiary structure of 
Gsw loop and the consequently induced differences in 
ground-state conformational dynamics thus significantly 
alter the ligand-induced RNA folding process compared 
to the wild-type sequence (22). 



Mg 2+ dependence of the kinetic rates of ligand-induced 
RNA folding. We analyzed the Mg 2+ dependence of 
the conformational ensemble characteristics of the 
ligand-free state of Gsw loop on the kinetics of ligand- 
induced RNA folding by time-resolved NMR at 
[RNA]:[Mg 2+ ] ratios between 1:5 and 1:20. As kinetic 
rates did not show significant differences for individual 
nucleotides and could be fitted by monoexponential func- 
tions (Figure 3 and Supplementary Table S2), we 
determined an overall time constant A overall ([s -1 ]) for 
ligand-induced RNA folding at each Mg 2+ concentration. 
Supplementary Figure S4 shows the build-up kinetics at 
three characteristic [RNA]:[Mg 2+ ] ratios, indicating that 
ligand-induced RNA folding is strongly dependent on 
Mg 2+ concentration. 
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Figure 3. Ligand-induced folding of Gsw loop ([RNA]:[ligand]:[Mg 2+ ] ~1:1:8, 700 MHz, 283 K). Normalized integrals over time [s] of exemplary imino 
proton signals of (a) nucleotide U49 within the ligand-binding core region and of (b) nucleotide G38 forming part of the long-range inter-helical base 
pairing interactions (dark gray solid line: monoexponential fit), (c) Secondary structure of the G37A/C61U-mutant of the guanine-sensing riboswitch 
aptamer domain of the B. subtilis xpt-pbuX operon (for further construct details, see Supplementary Data). 



In addition to the influence of Mg concentration on 
the kinetics of ligand-induced folding of Gsw loop , we also 
observe that the populations of the ligand-free to the 
ligand-bound states at equimolar ratio of RNA and 
ligand are influenced by the Mg 2+ concentration (Figure 
4). Analysis of NMR spectra following ligand-induced 
RNA folding support that RNA-ligand complex forma- 
tion is completed at [RNA]:[Mg 2+ ] ~ 1:20. At these experi- 
mental conditions, imino proton signals indicative 
for Gsw loop in its free conformation can no longer 
be detected in the final ligand-bound state 
([RNA]:[ligand]~l:l). In contrast, for [RNA]:[Mg 2+ ] 
ratios <1:20, NMR spectra show signals for the RNA- 
ligand complex as well as for the ligand-free conformation 
of Gsw loop (Figure 4). 

In conclusion, the fit of the rate-dependence fc OV eraii t0 a 
sigmoidal curve (Figure 5a) assumes that the dependence 
on the [Mg 2+ ]:[RNA] ratio approaches saturation at 
[Mg 2+ ]:[RNA] > 20:1. This is in agreement with the obser- 
vation that at high [Mg 2+ ]:[RNA] ratios exclusively the 
ligand-bound state is populated (Figure 4). 

Comparison of the Mg 2+ influence on ground-state 
characteristics and kinetic rates of ligand-induced RNA 
folding for Gsw'°" p and Gsw ap1 . In contrast to the 
wild-type Gsw apt , the mutant Gsw loop RNA requires 
Mg 2+ to rescue ligand-binding capability (12). Minor 
changes in the native RNA sequence result in fundamen- 
tally different folding behavior of the Gsw loop RNA. 
Table 1 provides a comparison of structural characteristics 
and ligand-binding kinetics for Gsw apt (22) and Gsw loop 
for different relevant Mg 2+ concentrations. 

Structural characteristics of the ground-state ensembles 
are substantially affected by Mg 2+ in case of the Gsw loop 
RNA, however not as pronounced for the wild-type 
Gsw apt . Ligand-induced folding of Gsw loop at intermedi- 
ate Mg 2+ concentrations displays a smaller variance of 
nucleotide-specific kinetic rates compared to the wild- 
type. These individual kinetic variations are no longer 
correlated in context of the 3D structure (Supplementary 
Figure S5), consistent with a different folding pathway. 
Starting from a structurally less defined ensemble, due to 
only transiently stabilized tertiary interactions, the 



ligand-bound RNA is comparable to the wild-type 
complex. However, the presence of Mg 2+ further induces 
electrostatic stabilization. Higher Mg 2+ concentrations ac- 
celerate folding beyond time-scales accessible by real-time 
NMR (<10s). 

DISCUSSION 

The restriction of the dynamic flexible conformational 
ensemble of the unfolded state of RNA toward the func- 
tional state is one of the major driving forces during RNA 
folding. The formation of a compact, partially structured 
intermediate state ensemble can restrict the number of dy- 
namical accessible states and guide folding trajectories. 
Long-range tertiary contacts can determine folding rates 
and select for specific folding pathways (31,32). Due to the 
polyanionic character of RNA, electrostatic charge com- 
pensation represents a major barrier to be overcome when 
folding complex tertiary structures. Consequently, 
divalent cations are often essentially required to stabilize 
the functional RNA conformation as well as lowering en- 
ergetic barriers of the RNA folding landscape. 

For riboswitches as a recently identified class of RNA 
regulating elements, understanding the regulation mech- 
anism and its dependence on ligand concentration and 
cofactor requirements is crucial to dissect the different 
factors that are important for the conformational re- 
arrangement. By analyzing the effect of changing Mg 2+ 
concentrations, we can delineate from other potential 
factors present in cells, including RNA chaperones, 
helicases and alike. The remarkable aspect of the free con- 
formation of the guanine-sensing riboswitch aptamer 
domain in the absence of ligand and Mg 2+ is the stably 
formed long-range loop-loop interaction, while nucleo- 
tides in the ligand-binding region are not involved in per- 
sistent base pairing interactions (28). Specific ligand 
binding can already be observed in the absence of Mg 2+ . 
The addition of Mg 2+ leads only to minor chemical shift 
changes of imino proton resonances indicating that Mg 2+ 
does not significantly affect the ground state ensemble of 
the wild-type riboswitch. However, comparison of the 
relative stabilities of the RNA in the presence and 
absence of Mg 2+ by temperature-dependent CD- and 
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Figure 4. Dependence of the equilibrium of free Gsw loop and Gsw loop -hypoxanthine complex on Mg 2+ concentration. Signals originating from 
nucleotide U51 can be detected only in the ligand-bound form, while signals from nucleotide U17 can be observed in the free (U17f ree ) and the RNA- 
ligand complex (U17 comp i ex ), however, with different chemical shifts. The signal of nucleotide U69 shows differences neither in intensity nor signifi- 
cantly in chemical shift in the two conformations, (a) Normalized integrals relative to the integral U17 lol!l i = U17 free + U17 oomp i ex for imino proton 
signal of U17 free! U17 conlpll!x and U51 as a function of the [Mg 2+ ]:[RNA] ratio, (b) Overlay of ID NMR spectra ( 15 N-edited) of Gsw loop ( 15 N-uridine 
labeled) in the presence of equimolar concentrations of hypoxanthine and varying concentrations of Mg 2+ (signal-to-noise is normalized relative to 
signal of nucleotide U69). 
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Figure 5. Mg 2+ dependence of the ligand-induced Gsw loop folding; (a) overall time constant ^overall [s -1 ] of ligand-induced RNA folding process as a 
function of [Mg 2+ ]:[RNA] ratio (the error is given from replica measurement at [Mg 2+ ]:[RNA] ~7:1; additional data at higher [Mg 2+ ]:[RNA] ratios 
>20:1 could not be obtained due to the limited time-resolution of the NMR measurements; gray dashed line: sigmoidal fit; the choice of the fitting 
function was motivated by assuming that £ OV eraii [ s ~ ] approaches saturation at an [RNA]:[Mg 2+ ] ratio of 1:20); (b) free activation energies of the 
Mg 2+ - and/or ligand-induced conformational transitions are schematically depicted. In the free G37A/C61U-mutant (M, absence of Mg 2+ ), second- 
ary structure elements but neither the loop-loop interaction nor the ligand-binding region are pre-formed and Gsw loop cannot bind ligand (28). 
Through variation of the [RNA]:[Mg 2+ ] ratio, the ligand-binding capability can be restored ({I} mg ) as well as formation of the tertiary loop-loop 
interaction at high Mg 2+ concentrations ([RNA]:[Mg 2+ ] >1:18). In the schematic diagram, this observation leads to a conformational RNA ensemble, 
whose dynamic and structural properties are Mg 2+ dependent (signified by {}). The addition of Mg 2+ leads to a rapid RNA conformational change. 
The derivation is based on the experimental finding that the Mg 2+ -induced conformational transition, monitored for Gsw loop at an [RNA]:[Mg 2+ ] 
ratio <1:18 (M^{I} mg ), is faster than ~8-10s. The conformational transitions induced by Mg 2+ are very fast and associated with energetic barriers 
that are small compared to the barriers associated with ligand binding. The kinetics of ligand binding ({I} m g->{C*} mg ) strongly dependent on the 
[RNA]:[Mg 2+ ] ratio. The variation in kinetic rate constants from the [Mg 2+ ] concentration implies Mg" + -dependent differences of the free activation 
energy. 



NMR experiments reveals significant stabilization, espe- 
cially for the tertiary loop-loop interaction in the 
presence of Mg 2+ but also for secondary structure (12). 

Despite pronounced dynamics of the RNA ensemble in 
the ligand-free form of the G37A/C61U mutant studied 
here resulting from introduced mutations, the final 
complex state adopts a WT-like compact overall fold. 
Our crystal structure of the G37A/C61U-mutant RNA 
reveals a ligand-bound conformation for which the 
global architecture but also ligand recognition and forma- 
tion of the binding pocket are almost identical to wild-type 
RNA-ligand complexes. The observed local structural 



differences in the tertiary loop-loop motif are in line 
with structural and stability differences compared to the 
wild-type guanine-sensing aptamer domain (12). The 
Mg 2+ dependence of structure formation observed in 
NMR studies is confirmed by observation of cobalt 
hexamine ions in the crystal structure, supporting the 
need to balance electrostatic repulsion effects for RNA 
structure compactness. 

Our observation of a fast Mg 2+ -induced RNA folding 
process for Gsw loop is in agreement with data for other 
RNAs (33). For the Tetrahymena ribozyme, Mg 2+ - 
induced conformational changes appear at a time scale 
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Table 1. Comparison of RNA structural characteristics of Gsw apl and Gsw loop and kinetic data for ligand-induced folding obtained by 
time-resolved NMR; (T~283 K, ligand: hypoxanthine) 



a Gsw apl without Mg 2 



a Gsw i,pl and Mg 2 



b Gsw loop without 
Mg 2+ 



b Gsw loop and Mg 2+ (1:8) 



b Gsw loop and Mg 2+ (1:33) 



Ligand-binding competent 
Loop-loop interaction 
preformed 



Core signals, fi/2~18.9— 23.6 s 
Loop/helix P2 signals, 



Ligand-binding competent 
Loop-loop interaction 

preformed 
Electrostatic compensation 

by Mg 2+ 
Folding completed <10s 
Folding completed <10s 



No tertiary 
structure 



Ligand-binding competent 
Tertiary structure transiently 
formed 

Electrostatic compensation by 
Mg 2+ 

Core signals, /!/ 2 ~20.1-25.6s 
Loop/helix P2 signals, 
/,p~20.6-24.5s 



Ligand-binding competent 
Loop-loop interaction 

preformed 
Electrostatic compensation 

by Mg 2+ 
folding completed <10s 
Folding completed <10s 



"Structural data published by Noeske et al. (28), kinetic data published by Buck et al. (22). 
b Structural data published by Buck et al. (12). 



of several milliseconds; these conformational changes have 
been interpreted as structural compaction and transient 
formation of native tertiary contacts (34-37). Persistently 
formed loop-loop interactions are not a pre-requisite for 
ligand binding but Mg 2+ exerts additional conformational 
changes beyond the stabilization of tertiary native 
contacts. The conformational changes induced by Mg 2+ 
are essential for formation of functional states but do, as 
observed, not represent rate-limiting steps for ligand 
binding and folding of Gsw loop . 

Comparison of kinetics at high Mg 2+ concentration for 
Gsw apt and Gsw oop show that, once key tertiary structural 
motifs are formed, Mg 2+ facilitates structural rearrange- 
ments and leads to fast RNA folding. This is in agreement 
with the observation that the RNA folding rate mainly 
depends on the stability of discrete folding intermediates 
(38). The sequential folding kinetics observed for the 
wild-type, Gsw apt (22), are a result of hierarchical 
tertiary structure formation, which in turn depends on 
their respective stabilities. Once Mg 2+ is present, such dif- 
ferences in stability are small compared to the large favor- 
able electrostatic contributions offered by Mg 2+ and such 
subtle effects are no longer observed in folding rate 
constants. 

The differences in ligand-induced folding kinetics for 
individual nucleotides previously observed for the 
guanine-sensing riboswitch aptamer domain (22) (in the 
absence of Mg 2+ ) cannot be observed for Gsw loop at an 
[RNA]:[Mg 2+ ] ratio of ~1:8. At this concentration ratio, 
the RNA is able to bind the ligand hypoxanthine despite 
the observation that the tertiary loop-loop interactions 
are not stably formed. Due to comparable time constants 
of all observed imino proton signals of Gsw loop , we 
conclude that the underlying RNA folding processes 
proceed concerted and cooperative. For Gsw loop , concur- 
rent structuring of the ligand-binding core region and for- 
mation of the long-range tertiary loop-loop interaction 
directly leads to a folding trajectory resulting in a prefer- 
ential compact orientation of the global RNA architecture. 
Under experimental conditions for which the tertiary 
loop-loop interactions are not pre-formed, kinetics of 
ligand binding strongly depend on the [RNA]:[Mg 2+ ] 
ratio. As shown in the schematic diagram in Figure 5b, 
this observation is assigned to a conformational RNA 



ensemble, whose dynamic and structural properties are 
Mg 2+ dependent. The variation in kinetic rate constants 
from the Mg 2+ concentration implies Mg 2+ -dependent dif- 
ferences of the free activation energy (Figure 5b). 

For Gsw loop , the first Mg 2+ -induced folding step is not 
strictly coupled to formation of the loop-loop interaction, 
but nevertheless leads to a ligand-binding competent RNA 
ensemble (12). This finding implies that Mg 2+ -induced 
folding represents the first obligatory folding step of the 
RNA aptamer domain during in vivo transcription, 
initiated by free Mg 2+ ; its rate depends on the physiologic- 
ally available Mg + concentration. The conformational 
dynamics of the RNA ensemble stabilized through 
Mg 2+ -ions determine the rate of complex formation of 
the aptamer domain and the cognate ligand. These rates 
are markedly affected for Gsw oop . The most compact 
structures and the fastest kinetics are detected in the 
presence of charge compensating counter ions. Mg 2+ 
leads to electrostatic charge compensation or even the for- 
mation of long-range tertiary loop-loop interactions. For 
Gsw loop , the tertiary loop-loop interactions form only at 
[RNA]:[Mg 2+ ] ratios >1:18, while at lower ratios the RNA 
ensemble is heterogeneous and the kinetics of folding are 
slow. A more heterogeneous RNA ensemble may lead to a 
strong entropic penalty in the ligand-induced folding, a 
factor that would increase the free activation energy of 
the folding reaction. The observed Mg~ + dependence 
suggests an important role for structural pre-organization 
of the free RNA ensemble to drive fast productive 
ligand-induced RNA folding. 

Our time-resolved NMR kinetics of ligand-induced 
RNA folding reveal that the rate of the ligand-binding 
process is sensitively regulated by the exact sequence in 
remote structure elements not involved in ligand binding 
itself and by the Mg + concentration. The folding rate 
correlates with the degree of structural pre-organization 
and the stability of remote RNA-RNA interactions. 
These results are in line with our previous studies, 
showing the Mg 2+ dependence of functional stability 
(12). The guanine-sensing aptamer mutant Gsw loop , for 
which tertiary structure stabilization and the rate of 
ligand binding can be tuned through variation of the 
Mg 2+ concentration might offer an interesting module 
for artificial riboswitch design in synthetic biology. The 
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biological function of the long-range loop-loop inter- 
action might involve the suppression of alternative 
folding pathways. In a primordial scenario reminiscent 
of the 'RNA world', intramolecular RNA-RNA inter- 
actions might serve as a helper motif, a primitive intramo- 
lecular RNA chaperone, to ensure proper folding of the 
overall tertiary scaffold to promote local folding of the 
RNA and thereby enable ligand binding in a biologically 
relevant time window as ultimate regulatory trigger. 
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